Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium
- Type
- BLLDB-Access:
  - free (81)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5

Hits 1 – 20 of 81

1	MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
	Khurana, Sameer; Laurent, Antoine; Glass, James
	In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
	BASE
	Show details

2	Simple and Effective Unsupervised Speech Synthesis ...
	Liu, Alexander H.; Lai, Cheng-I Jeff; Hsu, Wei-Ning. - : arXiv, 2022
	BASE
	Show details

3	Learning Audio-Video Language Representations
	Rouditchenko, Andrew. - : Massachusetts Institute of Technology, 2021
	BASE
	Show details

4	Cascaded Multilingual Audio-Visual Learning from Videos ...
	Rouditchenko, Andrew; Boggust, Angie; Harwath, David. - : arXiv, 2021
	BASE
	Show details

5	Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0 ...
	Khurana, Sameer; Laurent, Antoine; Glass, James. - : arXiv, 2021
	BASE
	Show details

6	Text-Free Image-to-Speech Synthesis Using Learned Segmental Units ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Glass, James; Harwath, David. - : Underline Science Inc., 2021
	BASE
	Show details

7	Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation? ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Glass, James; He, Tianxing. - : Underline Science Inc., 2021
	BASE
	Show details

8	Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
	Chuang, Yung-Sung; Gao, Mingye; Luo, Hongyin. - : arXiv, 2021
	BASE
	Show details

9	Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Chen, Yun-Nung; Chuang, Yung-Sung. - : Underline Science Inc., 2021
	BASE
	Show details

10	A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
	Khurana, Sameer; Laurent, Antoine; Hsu, Wei-Ning; Chorowski, Jan; Łańcucki, Adrian; Marxer, Ricard; Glass, James
	In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02912029 ; Interspeech 2020, Oct 2020, Shanghai, China (2020)
	Abstract: International audience ; Probabilistic Latent Variable Models (LVMs) provide an alternative to self-supervised learning approaches for linguistic representation learning from speech. LVMs admit an intuitive probabilistic interpretation where the latent structure shapes the information extracted from the signal. Even though LVMs have recently seen a renewed interest due to the introduction of Vari-ational Autoencoders (VAEs), their use for speech representation learning remains largely unexplored. In this work, we propose Convolutional Deep Markov Model (ConvDMM), a Gaus-sian state-space model with non-linear emission and transition functions modelled by deep neural networks. This unsupervised model is trained using black box variational inference. A deep convolutional neural network is used as an inference network for structured variational approximation. When trained on a large scale speech dataset (LibriSpeech), ConvDMM produces features that significantly outperform multiple self-supervised feature extracting methods on linear phone classification and recognition on the Wall Street Journal dataset. Furthermore, we found that ConvDMM complements self-supervised methods like Wav2Vec and PASE, improving on the results achieved with any of the methods alone. Lastly, we find that ConvDMM features enable learning better phone recognizers than any other features in an extreme low-resource regime with few labelled training examples.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; Neural Variational Latent Variable Model; Structured Variational Inference; Unsupervised Speech Representation Learning
	URL: https://hal.archives-ouvertes.fr/hal-02912029/file/convDMM_arxiv.pdf https://hal.archives-ouvertes.fr/hal-02912029/document https://hal.archives-ouvertes.fr/hal-02912029
	BASE
	Hide details

11	Similarity Analysis of Contextual Word Representation Models ...
	Wu, John M.; Belinkov, Yonatan; Sajjad, Hassan. - : arXiv, 2020
	BASE
	Show details

12	CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning ...
	Khurana, Sameer; Laurent, Antoine; Glass, James. - : arXiv, 2020
	BASE
	Show details

13	A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning ...
	Khurana, Sameer; Laurent, Antoine; Hsu, Wei-Ning. - : arXiv, 2020
	BASE
	Show details

14	What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context ...
	Baly, Ramy; Karadzhov, Georgi; An, Jisun. - : arXiv, 2020
	BASE
	Show details

15	Vector-Quantized Autoregressive Predictive Coding ...
	Chung, Yu-An; Tang, Hao; Glass, James. - : arXiv, 2020
	BASE
	Show details

16	Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies ...
	Liu, Alexander H.; Chung, Yu-An; Glass, James. - : arXiv, 2020
	BASE
	Show details

17	Improved Speech Representations with Multi-Target Autoregressive Predictive Coding ...
	Chung, Yu-An; Glass, James. - : arXiv, 2020
	BASE
	Show details

18	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
	Haulcy, R'mani(R'mani Symon); Glass, James R
	In: Frontiers (2020)
	BASE
	Show details

19	Identification of digital voice biomarkers for cognitive health
	Lin, Honghuang; Karjadi, Cody; Ang, Ting F. A....
	In: Explor Med (2020)
	BASE
	Show details

20	On the Linguistic Representational Power of Neural Machine Translation Models
	Belinkov, Yonatan; Durrani, Nadir; Dalvi, Fahim...
	In: Computational Linguistics, Vol 46, Iss 1, Pp 1-52 (2020) (2020)
	BASE
	Show details

Page: 1 2 3 4 5

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern